Query-Based Discovering of Popular Changes in WWW

نویسندگان

  • Adam Jatowt
  • Khoo Khyou Bun
  • Mitsuru Ishizuka
چکیده

This paper presents the method for retrieving and summarizing changes in topics from online resources. Users often want to know what are the major changes in their areas of interest. Usually, change detection applications are based on predetermined sets of web pages. User needs to provide the addresses of web pages in order to receive recent information about occurring changes. Our approach involves creation of dynamic web collection for a given area of user’s interest. Such collection would contain informative and up-to-date resources. Periodically, we monitor the set of pages in search for new textual data and detect significant terms to extract sentences reflecting popular changes within every period. Since many web pages can be static over long time, we propose a method for evaluating how up-to-date a web page is in context of a given topic. Each WWW page is scored according to the frequency and contents of its changes. The most valuable pages form a base for next change summarizations. Additionally we expand the web collection to include new, valuable resources by finding pages, which have similar characteristics to the top-scored pages from the collection.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discovering Popular Clicks\' Pattern of Teen Users for Query Recommendation

Search engines are still the most important gates for information search in internet. In this regard, providing the best response in the shortest time possible to the user's request is still desired. Normally, search engines are designed for adults and few policies have been employed considering teen users. Teen users are more biased in clicking the results list than are adult users. This leads...

متن کامل

Query expansion based on relevance feedback and latent semantic analysis

Web search engines are one of the most popular tools on the Internet which are widely-used by expert and novice users. Constructing an adequate query which represents the best specification of users’ information need to the search engine is an important concern of web users. Query expansion is a way to reduce this concern and increase user satisfaction. In this paper, a new method of query expa...

متن کامل

Discovering the Context of WWW Pages to Improve the Effectiveness of Local Search Engines

This work proposes a method of searching for information in hypertext systems representing WWW sites. The method is based on the creation of a 2-level index. The first level of the index is related to information located only inside the nodes. The second level of the index relates to information which is not restricted to one node but encompasses a set of related nodes. The second level is base...

متن کامل

Automatic discovery of synonyms and lexicalizations from the Web

The search of Web resources is a very important topic due to the huge amount of valuable information available in the WWW. Standard search engines can be a great help but they are often based only on the presence or absence of keywords. Thus problems regarding semantic ambiguity appear. In order to solve one of them, we propose a new method for discovering lexicalizations and synonyms of search...

متن کامل

Dataset Descriptions for Optimizing Federated Querying

Dataset description vocabularies focus on provenance, versioning, licensing, and similar metadata. VoID is a notable exception, providing some expressivity for describing subsets and their contents and can, to some extent, be used for discovering relevant resources and for optimizing querying. In this poster we describe an extension of VoID that provides the expressivity needed in order to supp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003